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ABSTRACT 


The  problem  of  selecting  the  normal  population  with  the  largest 
mean  from  several  populations  with  common  unknown  variance  is  considered. 
A  new  simple  two-stage  elimination  type  procedure  is  proposed  which 
guarantees  the  same  probability  requirement  using  the  indifference-zone 
approach  as  does  the  two-stage  non-elimination  type  procedure  of 
Bechhofer,  Dunnett  and  Sobel  (1954).  The  observations  in  the  first 
stage  are  used  not  only  to  estimate  the  common  unknown  vai ranee  but 
also  to  screen  out  non-contending  populations.  The  proposed  procedure 
has  some  similarities  with  the  procedure  of  Tamhane  (1975),  but  uses 
a  new  design  criterion  and  a  sharper  lower  bound  on  the  probability 
of  a  correct  selection.  A  table  of  constants  necessary  to  implement 
the  procedure  is  provided.  The  results  of  a  Monte  Carlo  study  of 
comparison  of  the  elimination  type  procedure  with  the  non-elimination 


type  procedure  are  also  given.  The  results  strongly  indicate  that  the 


proposed  elimination  type  procedure  performs  much  better  than  the  non-/ 
elimination  type  procedure  of  Bechhofer,  Dunnett  and  Sobel  (1954)  /  . ' /V- > 


in  terms  of  the  expected  total  sample  size. 
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1 .  INTRODUCTION 

Following  the  work  of  Bechhofer  (1954),  considerable  amount 
of  research  has  been  carried  out  for  the  problem  of  selecting  the 
normal  population  with  the  largest  mean  from  k(k  >  2)  normal 
populations  using  the  so-called  Indifference-zone  approach. 

For  the  case  of  common  known  variance,  Alam  (1970)  proposed 
a  two-stage  elimination  type  procedure.  In  which  he  uses  Gupta's 
(1956,  1965)  subset  selection  procedure  In  the  first  stage  to 
screen  out  non-contending  populations  and  Bechhofer 's  (1954) 
Indifference-zone  approach  to  all  populations  retained  In  the 
second  stage.  However,  most  of  his  work  was  limited  to  the 
special  case  of  k»2  populations.  Recently,  Tamhane  and  Bechhofer 
(1977,  1979)  studies  In  detail  a  two-stage  elimination  type 
procedure  using  a  minimax  design  criterion,  and  they  found  It  more 
efficient  than  the  single-stage  procedure  of  Bechhofer  (1954).  For 
relevant  results  regarding  various  two-stage  procedures,  see  Gupta 
and  Miescke  (1982a). 

For  the  case  of  cormon  unknown  variance.  It  Is  known  that  there 
does  not  exist  a  single-stage  procedure  which  guarantees  a  "mlnlmuln 
probability  requirement".  If  one  adopts  the  Indifference-zone  approach. 
Bechhofer,  Dunnett  and  Sobel  (1954)  (henceforth  referred  to  as  BDS) 
proposed  a  two-stage  non-elimination  type  procedure.  In  which  the 
observations  In  the  first  stage  are  only  used  to  obtain  an  estimate 
of  the  common  unknown  variance.  Tamhane  (1976)  proposed  a  three- 
stage  elimination  type  procedure,  and  carried  out  a  Monte  Carlo 
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study  to  compare  h1s  procedure  with  the  non-elimination  type  proce¬ 
dure  of  BDS  (1954).  However,  as  pointed  out  by  Hochberg  and  Marcus 
(1981),  Tamhane's  argument  is  not  quite  correct.  Tamhane  (1975) 
also  considered  a  two-stage  elimination  type  procedure  for  the  case 
of  common  unknown  variance.  However,  in  his  Monte  Carlo  sampling 
results,  the  performance  of  his  two-stage  elimination  type  procedure 
was  found  to  be  inferior  to  that  of  the  non-elimination  type  procedure 
of  BDS  (1954). 

In  this  paper,  we  propose  and  study  a  two-stage  elimination 
type  procedure  for  the  case  of  conwon  unknown  variance.  In  Section  2, 
a  precise  formulation  of  the  problem  and  a  two-stage  elimination  type 
procedure  are  given.  Section  3  discusses  some  possible  design  criteria, 
and  the  tables  to  implement  the  procedure  are  given.  Section  4 
gives  the  results  of  the  Monte  Carlo  study  in  which  we  compare  the 
two-stage  elimination  type  procedure  with  the  non-elimination  type 
procedure  of  BDS  (1954).  The  results  show  that  the  two-stage  elimin¬ 
ation  type  procedures  perform  much  better  than  the  non-elimination 
type  procedure  of  BDS  (1954). 

It  is  worth  noting  that  Paulson  (1964)  studies  a  sequential 
elimination  type  procedure  for  the  case  of  common  unknown  variance, 
and  Hochberg  and  Marcus  (1981)  considered  three-stage  elimination 
type  procedures  for  the  case  of  unequal  and  unknown  variances. 

Recently,  Gupta  and  Miescke  (1982b) and  Miescke  (1980)  have 
studied  two-stage  selection  procedures  when  we  have  a  control  popu¬ 
lation.  For  recent  reviews  of  the  literature,  see  Miescke  (1982). 


The  reader  is  referred  to  the  relevant  book  by  Gupta  and  Panchapakesan 
(1979)  for  a  complete  overview. 


2.  A  TWO- STAGE  PROCEDURE 

Let  ir^,...,Tr|^(k>2)  be  k  normal  populations  with  unknown  means 

2 

and  a  common  unknown  variance  o  .  The  ordered  values  of 
are  denoted  by  •  •iP[|(]*  No  prior  knowledge  is  assumed 

concerning  the  correct  pairing  between  and  (l<i.j<k). 

The  goal  of  the  experimenter  is  to  select  the  "best"  population, 
i.e.,  the  one  associated  with  the  largest  mean  Following  the 

indifference-zone  approach  by  Bechhofer  (1954),  the  experimenter, 
prior  to  the  experiment,  specifies  two  constants  6*>0  and  P*(l/k<P*<l) 
which  are  incorporated  into  a  probability  requirement 


Pq{CS}  >  P*  for  all  et  n(6*) 


(2.1 


where  CS  denotes  the  event  of  selecting  the  best  population  and 


n(4*)  =  (y-j,...  ,y|^,o)  I  "  *^[|(-1]  i  4*}. 

The  two-stage  non-elimination  type  procedure  of  BDS  (1954) 
and  Paulson's  (1964)  sequential  elimination  type  procedure  satisfy 
the  probability  requirement  (2.1).  However,  Tamhane's  (1976)  three- 
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stage  procedure  needs  to  be  re-examined  for  the  probability  require¬ 
ment  (2.1)  because  of  the  error  as  pointed  out  previously. 

The  following  two-stage  elimination  type  procedure  Pcan  be 
viewed  as  a  generalization  of  the  non-elimination  type  procedure 
of  BDS  (1954). 


Stage  1 :  Take  n^  (n^i2)  independent  observations  (j=l,...,n^) 

from  each  (i=l . k)  and  compute  the  sample  means 


n 


N  ^  *  1  (i®l.---*k)  and  the  pooled  sample  variance 

j=l 

k  *^1 

=  I  I  (X.,-  -  X,PV/k(n,  -  1).  Determine  a  subset  I  of 
i=l  j=l  ^  ‘ 


{l,...,k}  where 


I  »  (i'lXS^^  >  max  jS”  -  (05/^7-  «*)*). 
l<j<k  ^  ' 

the  symbol  a^  denoting  the  positive  part  of  a,  i.e.,  a^  =  max(a,0); 
here  d  is  a  positive  constant  defiend  in  (2.2)  below. 

(a)  If  I  has  only  one  element,  then  stop  sampling  and  assert 

that  the  population  associated  with  max  is  the  best. 

l^<k  J 

(b)  If  I  has  more  than  one  element,  then  proceed  to  the  second 
stage. 

Stage  2:  Take  N-n^  additional  observations  X^j  from  each  Tr^(it  I) 
where 


N  =  maxinp  [(hS/6*)^]}, 


the  symbol  [y]  denoting  the  smallest  integer  equal  to  or  greater 
than  y;  here  h  is  a  positive  constant  defined  in  (2.2)  below.  Then 
compute  the  overall  sample  means 

N 

I  X../N{ifel) 

^  j=l 

and  assert  that  the  population  associated  with  max  )(.  is  the  best. 

i€  I  ^ 

Remark  2.1 ;  This  procedure  is  essentially  of  the  same  type  as  the 
one  in  Tamhane  (1975)  even  though  we  have  a  different  type  of  screening 
procedure.  However,  it  is  Important  to  note  that  we  are  using  an 
improved  lower  bound  on  P.{CS}  to  determine  the  constands  d  and  h 
so  as  to  satisfy  the  probability  requirement  (2.1). 

Remark  2.2:  Note  that  in  the  limiting  case  d=<»,  this  procedure 
reduces  to  that  of  BDS  (1954). 

In  the  sequel,  «(•)  denotes  the  c.d.f.  of  the  standard  normal 
distribution  and  G^(«)  denotes  the  c.d.f.  of  (x^(v)/v)^^^  with 
X  (v)  denoting  the  chi-square  random  variable  with  v=k(ni-l)  degrees 
of  freedom. 

Theorem  2.1.  For  the  procedure  P,  we  have  the  following  inequalities: 

inf  P  {CS}  >  /  {/  $*^"^(x+dw)d$(x)  /  (y+hw)d4’(y)}dG  (w) 

n(s*)  S.  “  0  -«  -» 

00  OO  s  00  00 

>  /  /  <t>^''(x+dw)d'j(x)dG  (w)/  f  ♦*'”\x+hw)d*(x)(ffi  (w). 
0  -«>  ''  0  -«  '* 


Proof.  Let  and  let  and  X^.  denote  respectively 

the  first  stage  sample  mean  and  the  overall  sample  mean,  1=1, 

Then,  from  Tamhane  and  Bechhofer  (1979),  we  have  the  following 
Inequalities  for  all  (up. . . ,H|^,o) fe  £j(6*) ; 

yCS}  >  L^\]]  -  (dS/v^  -  &*)^,  -^(i)’  . 

/ni  ^  (^nT  .  XT 

*  Pju,.  •  V,-  . 

0i“oKi  o  a  i“a 


where  U^.  =  (^(ij  "  ^(k)  '^i  ~  ^^(i)  ”  ^(k)  ^ 

i=l ,. . . ,k-l .  Since  i4f  >  hs/6*  and  — *  (^1^  ■  — ^*)  >  d^ 

*“  a  Ki  a  a  —  o 

for  i=l,...,k-1,  it  follows  that  for  all  e€fi{6*). 


PefCS)  i P,(U,  <  a|.  V,  <h|,  i.l....,k-l). 


Now,  conditioning  on  S/a  =  w  and  applying  Slepian's  inequality  as  in 

Tamhane  and  Bechhofer  (1979),  we  obtain  the  first  inequality  in  the 

theorem.  We  note  that  the  lower  bound  on  PgiCS}  by  the  first  inequa- 

_  00 

lity  can  be  written  as  E{Fj(W)F^(W)}  where  Fj(w)  =  /  (x+dw)d4'(x) 

00 

"  k  1 

and  Fj^(w)  =  /  «  (x+hw)d#(x).  Since  F^  and  F^  are  non-decreasing 


in  w,  it  follows  from  Tchebysheff 's  inequality  (Hardy,  Littlewood 
and  Polya  (1934),  Theorem  43)  that 
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E{F^j(W)Fj^lW))  >  E{F^(W)}E{F^(W)} 

which  gives  the  second  Inequality  In  the  theorem. 

Now,  the  constants  d  and  h  in  the  procedure  p  can  be  chosen  by 
setting  either  of  the  two  lower  bounds  in  Theorem  2.1  to  be  P*  so 
as  to  satisfy  the  probability  requirement  (2.1). 

Even  though  the  second  lower  bound  is  less  sharp  than  the  first, 
it  is  much  easier  to  evaluate.  Furthermore,  the  functions  in  the 
second  lower  bound  are  the  c.d.f. 's  at  equicoordinate  point  of  a 
multivariate  central  t-distribution,  which  have  been  studied  and 
tabluated  by  Dunnett  (1955),  Gupta  and  Sobel  (1957)  and  Krishna! ah 
and  Armitage  (1966).  Thus,  we  used  the  second  lower  bound  to  compare 
our  two-stage  elimination  type  procedure  P  with  the  two-stage  non¬ 
elimination  type  procedure  of  BDS  (1954).  We  state  this  in  the  next 
result. 

Corollary  2.1 .  If  d  >  0  and  n  >  0  are  chosen  to  satisfy 

OO  00  ,  _  00  00 

/  /  /■^x+dw)d«(x)dG^(w)  /  /  r'^x+hw)d»(x)d6^(w)  =  P*,  (2.2) 

0  0 

then  the  procedure  P  satisfies  the  probability  requirement  (2.1). 

We  note  that  the  lower  bound  on  PgICS}  we  are  using  is  sharper 
than  the  one  in  Tamhane  (1975),  as  can  be  seen  in  Section  11  of 
Tamhane  and  Bechhofer  (1977). 

3.  A  DESIGN  CRITERION 

The  constants  n-j ,  d  and  h  in  the  procedure  p  can  be  considered 


s: 


9 


2 

as  design  constants.  As  in  the  case  of  known  a  ,  one  might  try  to 

minimize  the  maximum  of  the  expected  total  sample  size.  However, 

it  can  be  easily  seen  that  the  maximum  of  the  expected  total  sample 

size  w.r.t.  y-j,...,U|^  depends  on  unknown  a  and  increases  indefinitely 
2 

as  o'  increases.  Thus,  as  an  alternative,  we  might  ask  for  the  pro¬ 
cedure  P  to  have  a  high  probability  of  including  the  best  in  the  first 
stage. 

More  precisely,  we  ask  for  the  procedure  P  to  satisfy  the  following 
requirement  in  addition  to  (2.1):  For  all  n(6*), 

Pg{including  the  best  in  the  first  stage}  ^  P^  (3,1) 

where  P|(P*<P|<1)  is  a  preassigned  constant. 

It  can  be  easily  shown  that,  for  the  procedure  , 

inf  P-{including  the  best  in  the  first  stage) 
n(6*)  ^ 

=  /  /  $*^"^(x+dw)d«(x)dG^(w). 

0 

Therefore,  if  d  >  0  and  h  >  0  are  chosen  to  satisfy 

/  /  ♦''■^x+dw)d$(x)dG^(w)  =  P^ 

0  *00 

(3.2) 

CO  00 

^  /  ♦'^■^(x+hw)d«(x)d6^(w)  =  P*/Pf. 
then  the  procedure  P  satisfies  (2.1)  and  (3.1). 


Note  that  the  additional  requirement  (3.1)  for  the  procedure  P 
makes  the  choice  of  d  and  h  unique  once  n-j  is  given,  and  it  can  be 

regarded  as  a  design  criterion  for  d  and  h. 

1 

From  now  on,  we  will  denote  by  procedure  4’  satisfying 

(3.1)  with  =  0.99  (0.975).  Table  3.1  and  Table  3.2  give  the  constants 
d  and  h  for  p-j  and  P2,  respectively,  for  selected  values  of  n^ ,  k 

and  P*.  In  making  these  tables,  the  integrals  on  the  right  side  of 

(3.2)  were  computed  by  using  the  result  of  Hartley  (1943-46)  and 
Gauss-Hermite  quadrature  ^o™Lila  as  done  in  Gupta  and  Panchapakesan 
(1982). 

Another  simple  way  to  make  the  choice  of  d  and  h  unique  is  to 
assume  that  d=h.  In  such  a  case,  (2.2)  reduced  to 

00  00 

j  J  $'^‘^(x+dw)d$(x)dG  (w)  =  (3.3) 

0  -«  '' 

Note  that  this  is  equivalent  to  taking  P^  =  in  (3.2).  In  fact, 
this  was  the  choice  made  in  Tamhane's  (1975)  Monte  Carlo  study  even 
though  Tamhane  used  a  lower  bound  different  from  the  one  used  in 

(3.3) .  We  will  call  by  the  procedure  P  satisfying  (3.3).  The 
constants  d(=h)  in  procedure  3'"®  given  in  Table  3.3  for  selected 
values  of  n^ ,  k  and  P*. 

Also,  we  give  the  constnats  h  in  the  procedure  P(BDS)  of  BDS 
(1954)  in  Table  3.4.  Some  of  these  entries  can  also  be  found  in 
Dunnett  (1955),  Gupta  and  Sobel  (1957)  and  Krishna! ah  and  Armitage 
(1966).  The  values  seem  to  agree  well  in  most  of  the  common  cases 


n 


up  to  the  second  decimal  point  except  a  very  few  cases. 

Finally,  we  remark  that  we,  by  no  means,  claim  that  (3.2)  and 
(3.3)are  the  only  possible  design  criteria.  For  example,  the  expected 
number,  say  S,  of  populations  to  be  sampled  at  the  second  stage  can 
be  used  in  devising  a  design  criterion.  Furthermore,  following  the 
arguments  in  Section  6  of  Tamhane  and  Bechhofer  (1977),  it  can  be 
shown  that 


sup  E-{S}  =  k  ;  /  {$'^'^(x+dw)  -  *’^"^(x-dw)}d$(x)dG  (w)  (3.4) 

-  0  -CO 


Hence,  if  one  wishes  to  use  S  in  devising  a  design  criterion,  the  above  equation 
might  be  useful. 


TABLE  3.1 :  Constants  d  and  h  for  the  procedure 
(a)  constants  d  Pij"  =  0.99 


3 

5 

1 

5.064  ^ 

4.140 

5.026 

1 

4.277 

4.953 

4.330 

4.896 

4.368 

4.791 

4.502 

[  .  , 

4.795 

4.596 

i 

9 

10 

15 

1 

20 

25 

3.660 

1 

1 

3.614 

3.490 

3.435 

3.404 

3.917  1 

1 

3.881 

3.782 

3.737 

3.712 

4.045 

1 

4.016 

3.935 

3.897  ! 

1 

3.876 

4.131 

4.106 

4.038 

1 

4.006 

3.988 

4.370 

4.356 

4.316 

4.297 

j 

4.286 

4.502 

4.492 

4.463 

4.450 

^  1 

I  4.442  ! 

I 

I 

TABLE  3.T:  (Continued) 

P*  =  0.90 

9  10  15  20  25 

2  2.291  2.068  2.005  1.974  1.964  1.936  1.923  1.916 

3  2.677  2.473  2.413  2.384  2.375  2.348  2.335  2.328 

2.847  2.672  2.620  2.594  2.586  2.562  2.551  2.544 

5  2.953  2.802  2.756  2.733  2.726  2.704  2.694  2.689 

10  3.225  3.134  3.105  3.090  3.085  3.071  3.065  3.061 

15  3.368  3.301  3.279  3.269  3.265  3.255  3.250  3.247 


9  10  15  20  25 

2  3.446  2.834  2.698  2.636  2.617  2.562  2.537  2.522 

3  3.616  3.161  3.046  2.993  2.976  2.927  2.905  2.892 

3.666  3.315  3.220  3.174  3.156  3.118  3.099  3.088 

5  3.702  3.416  3.334  3.295  3.282  3.246  3.229  3.219 

10  3.837  3.682  3.634  3.610  3.602  3.580  3.569  3.563 

15  3.934  3.824  3.789  3.771  3.766  3.749  3.741  3.737 
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TABLE  3.2;  Constants  d  and  h  for  the  procedure 
(a)  constants  d  =  0.975 


X 

3 

5 

7 

9 

10 

15 

20 

25 

2 

4.152 

3.284 

3.087 

3.000 

2.972 

2.897 

2.863 

2.844 

3 

4.177 

3.553 

3.396 

3.324 

3.301 

3.237 

3.208 

3.191 

4 

4.151 

3.675 

3.547 

3.488 

3.468 

3.415 

3.390 

3.375 

5 

4.139 

3.755 

3.648 

3.597 

3.580 

3.534 

3.512 

3.500 

10 

4.178 

3.978 

3.916 

3.886 

3.876 

3.848 

3.835 

3.828 

15 

4.243 

4.104 

4.060 

4.038 

4.031 

4.010 

4.001 

3.995 

(b)  constants  h  P*  =  0.90 


3 

5 

7 

9 

10 

15 

20 

25 

2 

2.503 

2.229 

2.153 

2.117 

2.106 

2.073 

2.058 

2.049 

3 

2.862 

2.619 

2.550 

2.516 

2.505 

2.474 

2.460 

2.452 

4 

3.014 

2.810 

2.750 

2.720 

2.711 

2.684 

2.671 

2.664 

5 

3.108 

2.934 

2.881 

2.855 

2.847 

2.823 

2.812 

2.805 

10 

3.356 

3.253 

3.220 

3.204 

3.198 

3.183 

3.176 

3.172 

15 

3.491 

3.415 

3.391 

3.379 

3.375 

3.363 

3.358 

3.355 

14 


=  0.95 

9  10  15  20  25 

2  4.120  3.260  3.066  2.981  2.954  2.880  2.846  2.827 

3  4.149  3.533  3.378  3.307  3.284  3.221  3.192  3.176 

4.126  3.656  3.530  3.472  3.453  3.400  3.395  3.361 

5  4.116  3.737  3.631  3.581  3.565  3.519  3.498  3.485 

10  4.160  3.962  3.902  3.872  3.862  3.835  3.822  3.814 

15  4.227  4.089  4.046  4.024  4.017  3.997  3.987  3.982 

r  the  procedure 
.90 

9  10  15  20  25 

2  3.070  2.610  2.500  2.449  2.432  2.386  2.365  2.353 

3  3.329  2.963  2.867  2.821  2.806  2.765  2.746  2.735 

3.422  3.132  3.050  3.011  2.998  2.963  2.946  2.936 

5  3.482  3.241  3.171  3.137  3.126  3.094  3.080  3.071 

10  3.663  3.528  3.485  3.464  3.457  3.438  3.429  3.423 

15  3.774  3.677  3.646  3.631  3.626  3.611  3.604  3.600 


3 

5 

4.136 

: 

3.272 

4.163 

3.543 

4.139 

3.665 

4.128 

3.746 

4.169 

3.970 

4.235 

4.097 

3 

5 

2.173 

1.976 

2.573 

2.389 

2.752 

2.593 

2.864 

2.726 

3.149 

3.064 

3,297 

3.234 

10 

15 

20 

25 

2.963 

2.889 

2.855 

2.835 

3.293 

3.229 

3.200 

3.183 

3.461 

3.407 

3.382 

1 

1 

3.368 

3.573 

3.526 

1 

3.505 

3.493 

3.869 

3.842 

3.828 

3.821 

4.024 

4.004 

3.994 

3.988 

h  for  P 

(BDS) 

0.90 

10 

15 

20 

25 

1.881 

1.856 

1.844 

1.838 

2.298 

2.273 

2.262 

2.255 

2.513 

2.490 

2.480 

2.474 

2.654 

2.635 

2.625 

2.620 

3.019 

3.006 

3.000 

2.996 

3.200 

3,191 

3.186 

3.184 

TABLE  3.4:  (Continued) 


P*  =  0.95 


"l 

k 

3 

5 

7 

9 

10 

15 

20 

25 

2 

3.110 

2.634 

2.522 

2.469 

2.453 

2.406 

2.384 

2.372 

3 

3.360 

2.985 

2.886 

2.840 

2.825 

2.783 

2.763 

2.752 

4 

3.448 

3.152 

3.069 

3.029 

3.016 

2.979 

2.963 

2.953 

5 

3.506 

3.260 

3.189 

3.154 

3.143 

3.111 

3.096 

3.088 

10 

3.682 

3.544 

3.502 

3.480 

3.473 

3.454 

3.444 

3.439 

15 

3.792 

3.694 

3.662 

3.646 

3.641 

3.627 

3.620 

3.616 

4,  RESULTS  OF  MONTE  CARLO  COMPARISON 
The  elimination  type  procedure  P  in  Section  2  can,  in  fact,  be 
regarded  as  a  class  of  procedures,  since  there  are  many  ways  of  choosing 
the  constants  d  and  h  satisfying  (2.2).  In  particular,  the  procedures 
R| ,  Pg*  *3  Section  3  can  be  regarded  as  various  versions  of  the 

elimination  type  procedure  P.  Hence,  a  comparison  of  the  procedures 
Pj ,  p2,  Pj  with  the  non-elimination  type  procedure  P(BDS)  can  shed 
light  on  the  effect  of  elimination  in  the  first  stage. 

This  section  gives  the  results  of  Monte  Carlo  comparisons  of 
the  elimination  type  procedures  Pj,  Pg,  P^  with  the  non-elimination 
type  procedure  P(BDS)  in  terms  of  the  expected  total  sample  sizes. 
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We  will  use  respectively  T  and  S  as  generic  notations  for  the 
total  sample  size  and  the  number  of  populations  to  be  sampled  in  the 
second  stage  in  using  the  elimination  type  procedure.  Then,  the 
total  sample  size  of  the  elimination  type  procedure  is  given  by 


T  =  kn^  +  S(N-n^ ) 


(4.1) 


where  S  is  an  integer-valued  random  variable  which  can  take  values 
1  to  k,  inclusive.  Now  the  total  sample  size  of  the  non-elimination 
type  procedure  P(BDS)  is  given  by  T=kN.  Expressions  for  the  expected 
total  sample  sizes  can  be  found  in  BDS  (1954)  and  Tamhane  (1975). 

For  the  elimination  type  procedures  in  this  simulation  study, 
the  estimates  of  the  expected  total  sample  sizes  E-{T)  were  computed 
at  the  so-called  equal  means  configuration  (EMC), 
and  at  the  least  favorable  configuration  (LFC) 

Note  that  Eg(T}  of  the  procedure  P(BDS)  does  not  depend  on 

The  Monte  Carlo  comparisons  were  carried  out  for  k=5,10,15, 
P*=0.90,  0.95  and  6*=0.5,1.0.  The  choice  of  n^ ,  the  first  sample 
size  for  each  population,  is  optional  and  we  considered  the  procedures 
with  n^=5,10,15,20,25  to  get  some  idea  about  the  relationship  between 
Eq{T}  and  n^.  For  each  (k,P*,6*,n.j )-combination,  we  considered  the 
cases  of  a=l,2,3  and  1000  simulations  were  carried  out.  In  each 
simulation,  k  independent  standard  normal  random  variables  were 
generated  by  the  random  number  generator  of  H.  Rubin  and  C.  Hinkle 


at  Purdue  University.  In  this  paper,  we  have  reported  the  results 
only  for  P*=0.90  and  6*=1.0  and  the  results  for  other  cases  indicate 
similar  conclusions  and  are  available  upon  request. 

The  estimates  of  the  expected  total  sample  sizes  of  the  procedures 
,  p2,  Pj  and  P(BDS)  are  given  in  Table  4.1  along  with  their  respec¬ 
tive  estimated  standard  errors.  Throughout  all  the  cases  considered, 
the  elimination  type  procedures  P-j,  P2,  P^  have  substantially  smaller 
E{T}  values  than  corresponding  E{T}  values  of  the  non-elimination  type 
procedure  P(BDS),  except  when  the  procedures  stop  at  the  first  stage 
(these  cases  are  denoted  by  a  symbol  t  in  Table  4.1  and  Table  4.2). 

It  can  also  be  observed  from  Table  4.1  thcit  the  reduction  in  E{T} 
by  using  the  elimination  type  procedure  increases  with  k  for  fixed 
n^i  and  a. 

Remark  4.1 :  The  superiority  of  the  two-stage  elimination  type  pro¬ 
cedure  to  the  two-stage  non-elimination  type  procedure  is  what  one 
might  expect  intuitively;  however,  this  was  not  revealed  in  Tamhane's 
(1975)  Monte  Carlo  study.  We  think  that  it  was  because  he  used  a 
lower  bound  on  P„{CS}  less  sharp  than  the  one  in  this  paper,  as  pointed 
out  in  Remark  2.1. 

From  Table  4.1,  we  also  observe  that,  as  n.|  increases,  E{T}  values 
decrease  at  the  beginning  and  then  increase.  Such  a  relationship 
between  n-j  and  E{T}  for  the  procedure  P(BDS)  has  been  given  in 
BDS  (1954).  A  glance  at  the  results  in  Table  A.lshows  what  n^  should 
be  for  the  range  of  a-values  considered. 


TABLE  4.1. 

Monte  Carlo  estimates  of  the  expected  total  sample  size 
The  numbers  in  parentheses  are  the  estimated  standard  errors, 
(a)  k  =  5,  «*  =  1.0,  P*  =  0.90 
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TABLE  4.1 .(Continued) 

Monte  Carlo  estimates  of  the  total  expected  sample  size 
(b)  k  =  10,  6*  =  1.0,  P*  =  0.90 


_ ■  I 

1130  ISiSl 
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To  summarize  the  comparison*  we  computed  the  estimated  relative 
savings  RSAVE  expressed  In  percentages  In  using  an  elimination  type 
procedure  P  over  the  non-elimination  type  procedure  P(BDS),  I.e., 

RSAVE  {P  |P(BDS)}  =  x  100(S!) 

These  values  are  based  on  Table  4.1,  and  are  given  In  Table  4.2. 

It  Is  observed  from  Table  4.2  that  there  Is  a  more  pronounced 

Improvement  of  the  elimination  type  procedure  over  the  procedure 

P(B0S)  at  the  LFC  than  at  the  EMC.  In  fact,  it  can  shown  that,  for 
2 

fixed  0  ,  E-{T}  of  the  elimination  type  procedure  Is  maximized  at  the 
0 

EMC  when  £=(u^ • ,U|^,o)  vaires  over  the  whole  parameter  space.  Thus, 
even  greater  savings  can  be  obtained  by  using  the  elimination  type 
procedure  than  those  indicated  by  the  values  In  Table  4.2.  Finally, 

It  should  be  pointed  out  that  the  superiority  of  one  procedure  over 
others  among  the  procedures  P^ ,  P2*  *^3  quite  clear  from  Table 

4.2.  Thus,  It  Is  recommended  that  the  choice  of  d  and  h  satisfying 
(2.2)  may  be  made  to  depend  on  the  experimenter's  other  requirements 
such  as  those  described  In  Section  3.  In  this  respect,  we  have  computed 
the  expected  numbers  of  populations  to  be  sampled  In  the  second  stage, 
E{S},  as  by-products  of  a  simulation  study,  which  are  given  In  Table  4.3. 
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TABLE  4.2. 

Estimated  relative  savings  expressed  in  percentages  in  using  procedures 
Pj,  p2,  Pj  over  the  procedure  P(BDS). 

(a)  k  =  5,  6*  =  1.0,  P*  =  0.90 


I  LFC 

1  EMC _ _ _ J 

"l 

a 

^1 

‘’2 

*^3 
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*^2 

‘^3 

1 

35.1 
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TABLE  4.3. 

Monte  Carlo  estimates  of  the  expected  number  of  populations  to  be 
sampled  in  the  second  stage.  The  numbers  in  parentheses  are  the  esti¬ 
mated  standard  errors. 


(a)  k  =  5,  6*  =  1.0,  P*  =  0.90 


TABLE  4.3.  (Continued) 

Monte  Carlo  estimates  of  the  expected  number  of  populations  to  be  sampled 
in  the  second  stage. 

(b)  k  =  10,  6*  =  1.0,  P*  =  0.90 


VOO 
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TABLE  4.3.  (Continued) 

Monte  Carlo  estimates  of  the  expected  number  of  populations  to  be  sampled 
in  the  second  stage. 

(c)  k  =  15,  6*  =  1.0,  P*  =  0.90 
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The  problem  of  selecting  the  normal  population  with  the  largest  mean  from 
several  populations  with  coninon  unknown  variance  is  considered.  A  new  simple 
two-stage  elimination  type  procedure  is  proposed  which  guarantees  the  same 
probability  requirement  using  the  indifference-zone  approach  as  does  the  tvro- 
stage  non-elimination  type  procedure  of  Bechhofer,  Dunnett  and  Sobel  (1954). 
The  observations  in  the  first  stage  are  used  not  only  to  estimate  the  conrion 
unknown  variance  but  also  to  screen  out  non-contending  populations.  The 
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proposed  procedure  has  some  similarities  witri  the  procedure  of  TeMhene 
(1975),  but  uses  a  new  design  criterion  and  a  sharper  lower  bound  on 
the  probability  of  a  correct  selection.  A  table  of  consUnts  necesury 
to  Implement  the  procedure  Is  provided.  The  results  of  a  Monte  Carlo 
study  of  comparison  of  the  elimination  type  procedure  with  the  non- 
elimination  type  procedure  are  also  given.  The  results  strongly  Indicate 
that  the  proposed  elimination  type  procedure  performs  much  better  than 
the  non-elimination  type  procedure  of  Bechhofer,  Dunnett  and  Sobel 
(1954)  In  terms  of  the  expected  total  sample  size. 
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